Using Semantics and Statistics to Turn Data into Knowledge

نویسندگان

  • Jay Pujara
  • Hui Miao
  • Lise Getoor
  • William W. Cohen
چکیده

SPRING 2015 65 Agrowing body of research focuses on extracting knowledge from text such as news reports, encyclopedic articles, and scholarly research in specialized domains. Much of this data is freely available on the World Wide Web and harnessing the knowledge contained in millions of web documents remains a problem of particular interest. The scale and diversity of this content pose a formidable challenge for systems designed to extract this knowledge. Many well-known broad domain and open information-extraction systems seek to build knowledge bases from text, including the Never-Ending Language Learning (NELL) project (Carlson et al. 2010), OpenIE (Etzioni et al. 2008), DeepDive (Niu et al. 2012), and efforts at Google (Pasca et al. 2006). Ultimately, these information-extraction systems produce a collection of candidate facts that include a set of entities, attributes of these entities, and the relations between these entities. Information-extraction systems use a sophisticated collection of strategies to generate candidate facts from web documents, spanning the syntactic, lexical, and structural features of text (Weikum and Theobald 2010, Wimalasuriya and Dou 2010). Although these systems are capable of extracting many candidate facts from the web, their output is often hampered by noise. Documents contain inaccurate, outdat-

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

نظریه پردازی بر فرآیند انتقال دانش نظری به حوزه عمل در پرستاری: رویکرد گراندد تئوری

Introduction & Objective: Knowledge transfer and in fact, the bridging of theory and practice is one of the main concerns of all academic disciplines. Getting prominent professional status is the thing that can be achieved by knowledge-based function, and of which would be called as successful discipline that it be able to transfer its theoretical paradigmatic claims into practice. Accordingly,...

متن کامل

End-to-End Memory Networks with Knowledge Carryover for Multi-Turn Spoken Language Understanding

Spoken language understanding (SLU) is a core component of a spoken dialogue system. In the traditional architecture of dialogue systems, the SLU component treats each utterance independent of each other, and then the following components aggregate the multi-turn information in the separate phases. However, there are two challenges: 1) errors from previous turns may be propagated and then degra...

متن کامل

Analyzing the problem of meaning in Shabastari’s Golshane Raz

Man has always been finding a complete model for semantics since the beginning. A model which can as a paradigm affects all branches of sciences. In the view of author, such a model can be found in Golshane Raz. Introducing the model from the work mentioned, the paper has tried to explain its sub structural foundations in three fields of ontology, epistemology and semantics. Some of the foundat...

متن کامل

Declarative Semantics in Object-Oriented Software Development - A Taxonomy and Survey

One of the modern paradigms to develop an application is object oriented analysis and design. In this paradigm, there are several objects and each object plays some specific roles in applications. In an application, we must distinguish between procedural semantics and declarative semantics for their implementation in a specific programming language. For the procedural semantics, we can write a ...

متن کامل

Interrogation of a University Classrooms in the Court of Semantics: Managerial Implications

The purpose of this article, within the framework of an interpretive study, was to study the semantics of a universitychr('39')s classrooms to create a critical awareness of the meanings of the symptoms and their functions at the context of physical artifacts, besides their managerial implications. To accomplish this goal, after taking pictures of the structural elements of the studied classroo...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • AI Magazine

دوره 36  شماره 

صفحات  -

تاریخ انتشار 2015